Multilevel and channel-compensated language recognition: ATVS-UAM systems at NIST LRE 2009
نویسندگان
چکیده
This paper presents the systems submitted by ATVS – Biometric Recognition Group at 2009 language recognition evaluation, organized by the National Institute of Standards and Technology of United States (NIST LRE’09). Apart from the huge size of the databases involved, two main factors turn the evaluation into a very difficult task. First, the number of languages to be recognized was the biggest of all NIST LRE campaigns (23 different target languages). Second, the database conditions were strongly variable, with telephone speech coming from both broadcast news, extracted from Voice Of America (VOA) broadcast system, and conversational telephone speech (CTS). ATVS participation consisted of state-of-the-art acoustic and highlevel systems incorporating session variability compensation via Factor Analysis. Moreover, a novel back-end based on anchor models was used in order to fuse individual systems prior to one-vs.-all calibration via logistic regression. Results both in development and evaluation corpora show the robustness and excellent performance for most of the languages (among them, Iberian languages such as Spanish and Portuguese)
منابع مشابه
Data selection and calibration issues in automatic language recognition - investigation with BUT-AGNITIO NIST LRE 2009 system
This paper summarizes the BUT-AGNITIO system for NIST Language Recognition Evaluation 2009. The post-evaluation analysis aimed mainly at improving the quality of the data (fixing language label problems and detecting overlapping speakers in the training and development sets) and investigation of different compositions of the development set. The paper further investigates into JFA-based acousti...
متن کاملFusing language information from diverse data sources for phonotactic language recognition
The baseline approach in building phonotactic language recognition systems is to characterize each language by a single phonotactic model generated from all the available languagespecific training data. When several data sources are available for a given target language, system performance can be improved using language source-dependent phonotactic models. In this case, the common practice is t...
متن کاملDiscriminative acoustic language recognition via channel-compensated GMM statistics
We propose a novel design for acoustic feature-based automatic spoken language recognizers. Our design is inspired by recent advances in text-independent speaker recognition, where intraclass variability is modeled by factor analysis in Gaussian mixture model (GMM) space. We use approximations to GMMlikelihoods which allow variable-length data sequences to be represented as statistics of fixed ...
متن کاملAtvs-uam Nist Sre 2010 System
This paper describes the system submitted by ATVS-UAM to the 2010 edition of NIST Speaker Recognition Evaluation (SRE). Instead of focusing on multiple, complex and heavy systems, our submission is based on a fast, light and efficient single system. Sample development results with English SRE08 data (data used in the previous evaluation in 2008) are 0.53% EER (Equal Error Rate) in tel-tel (tele...
متن کاملBUT language recognition system for NIST 2007 evaluations
This paper describes Brno University of Technology (BUT) system for 2007 NIST Language recognition (LRE) evaluation. The system is a fusion of 4 acoustic and 9 phonotactic subsystems. We have investigated several new topics such as discriminatively trained language models in phonotactic systems, and eigen-channel adaptation in model and feature domain in acoustic systems. We also point out the ...
متن کامل